Test for linearity between continuous confounder and binary outcome first , run a multivariate regression analysis second
نویسندگان
چکیده
Previous statistical studies have indicated that dichotomizing a continuous confounding variable in multivariate regression analyses can lead to biased estimation of the effect of exposures, treatments, and risk factors on outcomes. We suggest that, prior to entry in the multivariate analysis, one should test whether or not the continuous confounding variable is linearly related to log-odds of the binary outcome or hazard ratios of the time-to-event binary outcome. If there is a linear relationship, we encourage that the variable not be dichotomized. We illustrate this issue using clinical data that we recently published in the New England Journal of Medicine (NEJM). The linearity assumption is tested by restricted cubic splines using SAS/Stat® procedures. INTRODUCTION In clinical studies, investigators are frequently interested in determining the association between risk factors and binary outcomes (e.g., dead or alive at discharge) or time-to-event binary outcomes (e.g., time to death, time to readmission) after adjusting for confounding variables. Continuous confounders (such as patient age, blood pressure, glucose, etc.) are often dichotomized prior to entry in the multivariate regression model (e.g., old vs. young, high vs. low blood pressure). Dichotomization is widespread in clinical studies (Del Priore 1997), as such simplicity simplifies the statistical analysis and leads to easy interpretation and presentation of the results. Dichotomization assumes that the relationship between the predictor and the outcome is flat within intervals, i.e., a discontinuity in outcome as interval boundaries are crossed, however, this assumption is far less reasonable than a linearity assumption in most cases. From a statistical point of view, dichotomization reduces statistical power, primarily due to the reduction in the inherent variability of the predictor variable. It is known that dichotomizing continuous predictor variables may result in biased estimation, either in ordinary linear regression (Cumsille et al. 2000), or in logistic regression (Becher 1992; Schulgen et al. 1994), or in Cox proportional hazards regression (Altman et al. 1994; Buettner et al. 1997; Royston et al. 2006). Furthermore, it may result in inflation of the type-I error of the risk factor (Altman et al. 1994; Austin and Brunner 2004). In this paper, we emphasize this issue based on a previous clinical study of time-to-event outcome of heart failure that we conducted (Bhatia et al. 2006). CASE STUDY: OUTCOME OF HEART FAILURE WITH PRESERVED EJECTION FRACTION We conducted a study to compare the features and outcomes of patients with heart failure with preserved ejection fraction (n=880) with those of patients with heart failure with reduced ejection fraction (n=1570). We used Cox proportional-hazards regression analysis to identify factors associated with an increased risk of death after hospitalization for heart failure. Backward variable elimination was used to create a parsimonious model for predicting mortality. Systolic blood pressure was one of the variables selected for inclusion in the final Cox model. By forcing the variable denoting ejection-fraction group into the final 1 Statistics and Data Analysis SAS Global Forum 2009
منابع مشابه
The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کاملUsing generalized additive models to reduce residual confounding.
Traditionally, confounding by continuous variables is controlled by including a linear or categorical term in a regression model. Residual confounding occurs when the effect of the confounder on the outcome is mis-modelled. A continuous representation of a covariate was previously shown to result in a less biased estimate of the adjusted exposure effect than categorization provided the function...
متن کاملP-66: There Is No Difference between IVF/ICSI Cycle Outcome in Patients With and without PCOS; A Modified Poisson Regression Model
Background Polycystic ovary syndrome is a frequent condition in women of reproductive age with prevalence rate of 5-10%. This study purposed to determine the relationship between polycystic ovary syndrome and the outcome of assisted reproductive treatment (ART) cycle in Tehran, Iran. MaterialsAndMethods In this historical cohort study, 996 infertile women who were referred to Royan Institute (T...
متن کاملBasic concepts of statistical analysis for surgical research.
Appropriate statistical analyses are an integral part of surgical research. The purpose of this work is to assist surgeons and clinicians with the interpretation of statistics by providing a general understanding of the basic concepts that lead to choosing an appropriate statistical test for common study designs. It is extremely important to understand the nature of the data before embarking on...
متن کاملBayesian multivariate hierarchical transformation models for ROC analysis.
A Bayesian multivariate hierarchical transformation model (BMHTM) is developed for receiver operating characteristic (ROC) curve analysis based on clustered continuous diagnostic outcome data with covariates. Two special features of this model are that it incorporates non-linear monotone transformations of the outcomes and that multiple correlated outcomes may be analysed. The mean, variance, a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009